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While there is ample evidence that social and communication networks play a key role during 
the spread of new ideas, products, or services, network effects are expected to have diminished 
influence in the stationary state, when all users are aware of the innovation, and its usage pattern 
is determined mainly by its utility to the user. Here we study four mobile phone-based services 
available to over six million subscribers, allowing us to simultaneously monitor the communication 
network between individuals and the time- resolved service usage patterns. We find that usage highly 
correlates with the structure of the communication network, and demonstrate the coexistence on 
the same social network of two distinct usage classes, network effects being responsible for the 
quantifiable differences between them. To test the predictive power of our theory, we demonstrate 
that traditional marketing techniques are ineffective in permanently boosting service adoption, and 
propose a hub-based incentive mechanism that has the potential to enhance usage for one of the 
two service classes. 



Thanks to electronic communications that offer virtu- 
ally instantaneous information access, awareness of some 
products and services, such as iPod, MySpace, RAZR, 
and ringtones, have swept the population within a few 
months, resulting in millions of active users. Innova- 
tion diffusion studies offer a rather complete conceptual 
picture of this adoption process, capturing how an in- 
novation spreads from early adopters to the laggards 
along the links of the social and communication network 
El, S, H, 0, H, @1- Yet, all innovations eventually reach a 
saturated state, in which there is complete awareness of 
its benefits, and practically all those who could find the 
innovation useful have adopted it. The social network, 
that plays a key role during the spreading process 0, 
is not expected to impact this saturated state, as only 
the utility of the product or service to the individual 
user determines its day-to-day usage. This leaves open a 
question of major conceptual and practical importance: 
could the underlying social and communication network 
play a role in the usage of innovations? Here we show 
that this is indeed the case: the ties between individuals 
significantly impact the usage patterns of many services. 

Today our ability to study innovation diffusion and 
usage patterns is enhanced by the fact that virtually all 
electronic communications and purchasing events leave 
digital signatures [1, S Yet, advances are still hm- 
ited by the fact that we need to simultaneously track 
both social interactions and service usage patterns, which 
requires the merging of various proprietary databases, 
a procedure limited by significant technical and legal 
roadblocks, as well as legitimate confidentiality concerns. 
Given that mobile carriers must track for billing purposes 
each call users place or receive, effectively recording the 
underlying social network, and regularly introduce in- 
novative phone-based services, they offer unprecedented 
opportunities to explore the detailed dynamics of service 
usage jTl| . Here we capitalize on this opportunity, study- 
ing the role of the social network in the usage patterns 



of four mobile phone-based services, available to over a 
quarter of mobile phone users in a country. We first show 
that service usage is characterized by detectable dyadic 
effects, indicating that if a user uses a specific service in a 
regular fashion, her phone-based contacts will also use it 
regularly. Yet, despite this dyadic effect, we find that the 
four services cluster into two distinct classes, document- 
ing the coexistence on the same social network of two 
rather different usage patterns. We develop an analytical 
theory, rooted in diffusion models used re gula rly to model 
spreading effects in physical sciences jTl,lllj3and the 
spread of computer and biological viruses IsT 16l , [l7j to 
predict the role of the social network in service usage, and 
demonstrate that it accurately describes the observed us- 
age dynamics. To test the predictive power of the pro- 
posed quantitative framework, we explore the impact of 
advertising on service usage, showing that for the stud- 
ied services advertising has no long-term impact, a pre- 
diction that is directly supported by real data. Finally, 
we propose that hub-based incentives could be used to 
permanently boost service usage, but they are effective 
for one service class only, illustrating the practical im- 
portance of identifying the right class to which a service 
belongs to. 

The dataset studied by us records all calls initiated by 
over 6 million customers of a mobile phone company, al- 
lowing us to represent each phone number, replaced by a 
hash code to guarantee anonymity, as a node in the com- 
munication network. The undirected links between users 
represent voice calls or short messages, the two most 
widespread mobile phone-based communication services, 
resulting in approximately 10 million links each month. 
The call data spans a period of over a year, and is aggre- 
gated into intervals of two weeks, each record identifying 
the call's origin, destination, and the type of service used 
by the user. We focus on four services: Email, whose 
users can send and receive emails from their cell phones; 
WAP, that gives the user the opportunity to browse Web 
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sites; Chat, that allows users to exchange short messages 
anonymously; and MyFriends, which allows a group of 
registered users to contact all other members of the group 
at once. 

To characterize the service usage we first measured the 
number of active users for each service for over a year- 
long period. We find that the number of users who have 
used the service at least once from the beginning of our 
observational period increases monotonically (Fig. la- 
d), suggesting the presence of a classical adoption pro- 
cess, during which each month new users adopt each ser- 
vice. Yet, the number of individuals who use the ser- 
vice in a two-week interval is largely independent of time 
(Fig. la-d). Therefore, these services have swept the 
current user base and reached a saturated or endemic 
state before the beginning of our observational period. 
The increasing cumulative user number is not an evi- 
dence of a continued adoption process, but indicates that 
the saturated state hides a highly dynamic equilibrium, 
where the adopters use each service only sporadically. In- 
deed, the percentage of users that use the same service in 
two consecutive months is relatively small (37% for Chat, 
30% for Email, 47% for WAP, and 33% for MyFriends). 
Thus, in contrast with the traditional subjects of innova- 
tion diffusion studies, where once an individual accepts 
an innovation she is considered an adopter for the rest of 
the study, mobile phone-based services display a highly 
volatile usage pattern, with users entering and exiting 
the user base in large numbers in each two week period 
p^ . This may be rooted in the fact that these services 
are billed only for the air-time occurred during their us- 
age, thus turning the specific service on or off does not 
require additional financial commitments. Thus the only 
lock-in incentives are utility or network-based: a service 
may be indispensable for an individual or there may be 
network externalities, resulting from peer pressure by so- 
cial contacts towards its use. 

The fundamental hypothesis of all adoption studies 
is that information externalities play a crucial role in 
the spreading process: a given innovation spreads along 
the social links through advice, influence, or persuasion 
[l|, [3, Q. Yet, in the saturated state, when the user base 
is largely aware of the benefits of an innovation, social 
ties cease to exert a role as information source, thus us- 
age should be mainly utility-based. To establish if per- 
sonal contacts affect the usage pattern we determined the 
probability pi that two users that communicate with each 
other use the same service Si . By itself pi is not informa- 
tive, as with probability pf^'^'^ = pf any two individuals 
may also use the same service, where pi represents the 
fraction of mobile phone users that adopted Si. Thus 
a better measure of the role of personal recommenda- 
tion is pil p^^^^. We find that for Chat and MyFriends 
Pilp^""^^ > 10, while for Email and WAP p./p'^'"^ « 2 
(see Fig. 7 in the Supporting Text), evidence of a dyadic 
effect. Therefore, users that know each other are more 
likely to use the same service than expected based on a 
random adoption model. Thus the impact of the com- 



munication ties is not limited to the adoption phase, but 
it also affects the day-to-day usage of services. 

To check if service usage shows effects beyond the 
dyads, so that most members of a community tend to 
adopt the same service, we determined the fraction pf (d) 
of users of service Si among individuals d links away from 
a user of Si. For reference we also measure the fraction 
pf {d) of users in the vicinity of an individual that has 
not adopted Si. If adopters do not segregate into com- 
munities, then pY (d) = pf{d). Indeed, we find that for 
Email and WAP pf (d) and pf{d) largely overlap for most 
d values (Fig. 1 e and /), and the observed difference be- 
tween them at small d is relatively small, suggesting ei- 
ther the absence of compact user communities or at most 
weak local aggregation of users. In contrast, for Chat and 
MyFriends pf (d) is one to two orders of magnitude larger 
than pf{d) for small d (Fig. [1] 17 and h), indicating that 
the vicinity of a user has a significantly higher density of 
users than the vicinity of a non-user. To illustrate the 
presence or absence of a community-based segregation 
we visualized the adoption patterns in selected neighbor- 
hoods for Email and Chat. We find that Email is used 
uniformly across communities, with no apparent corre- 
lation between the underlying community structure and 
service usage (Fig. 2a). In contrast. Chat users have 
the tendency to form compact communities, so that in 
a given community either almost everybody or almost 
nobody uses Chat (Fig. 2b). 

If service usage is driven by communication with so- 
cial contacts, high-degree individuals should more likely 
use a given service than small-degree individuals. While 
the role of hubs and connectors in innovation diffusion 
is well documented [l|, 0, [1, [l^ , in the absence of suffi- 
cient statistics the functional dependence of the adoption 
rate on the degree remains unknown. Furthermore, it is 
unclear if such degree-dependent effects should emerge 
in the usage patterns as well. Therefore, we determined 
the usage rate among users with degree k, finding that 
for each service the probability that an individual uses a 
given service increases with her degree (Fig. li-l). For 
example, the likelihood that an individual with 100 con- 
tacts uses Chat is more than two orders of magnitude 
higher than for an individual with one or two contacts. 

In summary, the empirical results indicate that a com- 
mon feature of all four services is that they reached 
a highly dynamic saturated state, characterized by a 
degree-dependent usage. Yet, there are also significant 
differences between them, suggesting the existence of two 
adoption classes: Chat and MyFriends display a strong 
community-based segregation and high dyadic effects, in 
contrast with Email and WAP, that are characterized by 
weaker dyadic effect and the apparent absence of com- 
pact user communities. Thus for each service we observe 
correlations between the structure of the underlying com- 
munication network and the likelihood of service usage 
by individual users. 

A methodological challenge in understanding service 
usage is that we do not know why an individual uses a 



3 



specific service: is she persuaded by an acquaintance, or 
influenced by non-social factors, such as service utility, 
an advertisement, or a newspaper article? Furthermore, 
an individual can be persuaded to use a service by an 
acquaintance that she never contacts through her mobile 
phone. To account for these ambiguities, we grouped all 
individuals in two categories: connected ("c") individu- 
als are those that are in phone contact with at least one 
user of service Si, thus have a chance to be influenced by 
an acquaintance, and isolated ("«") individuals represent 
those that have no user of Si among their phone contacts, 
thus can be influenced only by external factors, such as 
mass media or acquaintances they do not call (20|. Fur- 
thermore, we will call an individual a user if during a 
one month interval she used service Si, and a potential 
user if she did not use the service during the month-long 
period, but could potentially become a user in the fu- 
ture. Therefore, we distinguish between four categories 
of individuals, isolated and connected potential users, as 
well as isolated and connected users, the number of k- 
degree individuals at time t in each category being given 
by Nl{t), N^{t), Ul{t) and U^{t), respectively. For a 
given service the changes in the number of isolated and 
connected adopters with degree k can be written as 



dt 
dt 
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where /ifc is the rate at which a potential user becomes 
an user due to external influence, and rjk (^fe) represents 
the rate at which isolated (connected) users stop using 
the service [2l|, going from users to potential users. For 
connected potential users Afc gives the rate at which they 
turn users as a result of the influence of one of their 
user contacts, and Q(t) is the probability that a user's 
randomly chosen link points to a user, given by [l6l . 
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(k) NP{k') 
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where the first term in the product is the probability of 
finding a degree-fc' individual at the end of a randomly 
chosen link if the degree distribution of the communica- 
tion network is P{k), and the second term is the probabil- 
ity that this individual is an user, Uk'{t) = Ul,{t)+U^,{t), 
and N is the number of individuals in the network. The 
time evolution of the number of potential users is given 
by 



dt 
dt 
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dt 
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where Ilk{t) is the rate at which isolated potential users 
acquire user neighbors, becoming connected. If degree 



correlations are absent, we can assume that any non- 
user neighbor with degree k' turns user at the same rate 
where Nk'{t) = Nl,{t) + N^^,{t). Accord- 
ingly, for isolated potential users with degree k 



Ilk{t) = Nlit)kY, 



k'P{k')dUk'{t) 1 

(fc) dt Nk'{ty 
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Next we show that Eqs. ((I])-(l3]) with the closures pro- 
vided by ([HI)-® accurately predict the dynamics of the 
observed service usage pattern. 

The input of (H]) and ^ are the time-independent 
adoption functions /ifc, rik, Xk, and ^fc, which quantify 
the response of the user base to the service. These 
adoption functions were measured using a self-consistent 
method described in the Supporting Text, and are shown 
in Fig. 2c-j. The measurements were performed for two 
intervals ten months apart, finding in both cases the 
same functional form (see the filled and empty symbols in 
Fig. 2c- j), supporting our starting hypothesis that the 
adoption functions are time-independent. To test the 
predictive power of the proposed analytical model, we 
numerically integrated Eqs. ([I])-©, starting with the ini- 
tial condition in which no users are present at time t — Q. 
The integration predicts that the number of users rapidly 
increases followed by saturation (Fig. 3a-rf). Most im- 
portant, in the saturated state ([T|) and ^ reproduce not 
only the total number of users, but also the detailed k- 
dependence of all user categories (Fig. ie-h). The inspec- 
tion of the adoption functions and the analytical model 
suggests the existence of two classes of services, that we 
discuss separately below. 

Individual Services: The most striking common fea- 
ture of Email and WAP is the fact that for both services 
Mfe ^ ^k (Fig- 2 c and d), i.e. external influence, such as 
mass media and advertisements, dominate over personal 
persuasion. Furthermore, for these two services rik ~ (,k, 
indicating that there is little difference in the probability 
that an isolated or a connected user suspends the service. 
For ^k ^ Xk and T]k = S,k Eqs. (U)-© reduce to 



dUkjt) 
dt 



= p^k[NP{k)-Uk{t)]-TikUk{t), (7) 



where Uk{t) denotes the total number of users with de- 
gree k. In the stationary state Eq. ([7]) predicts that 



Uk{t (X)) 
NP{k) 



1 



1 + m 



(8) 



i.e. the adoption rate is determined only by /i^ and rjk. 
As shown in Fig. 1 i and j, the prediction ([8]) is in good 
agreement with the measured /c-dependent adoption rates 
for Email and WAP. 

Cooperative Services: In contrast with the individual 
services, for Chat and My Friends Xk ^ Mfcj so the influ- 
ence of social contacts is more important than external 
influence. Furthermore, for these services rjk £,k (Fig. 2 
i and j), implying that isolated users are more likely to 
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abandon the service than connected ones. Thus Eq. ([8]) 
is not expected to hold, but the integration of (HI)-® 
correctly reproduces the /c-dependent adoption rates ob- 
served in the real data (Fig. 3 g and h). The emergence of 
communities observed for Chat and MyFriends is also a 
consequence of the functional form of the adoption func- 
tions. Indeed, with high A^, if an individual has two or 
more user neighbors, his or her likelihood of adoption in- 
creases, enhancing the pressure on the members of the 
same community to adopt the same service. At the same 
time 77fc 3> £,k leads to a preferential discontinuation by 
isolated individuals, leading to the observed virtual ab- 
sence of users in communities in which users do not domi- 
nate, and leaving usage intact only in communities where 
most individuals use the same service. 

Therefore, the empirically observed two distinct usage 
patterns can be reduced to the relative strength of Afc 
and /ife, quantifying the user base's attitude to a specific 
service. Indeed, Chat and MyFriends facilitate alterna- 
tive ways of communication between individuals, thus 
they are subject to network externalities [13, H^: the 
value of a service to a given user increases if her acquain- 
tances also adopt the service, responsible for the observed 
Afe 3> relation. Isolated users do not experience such 
peer pressure, thus are more likely to drop the service 
than users surrounded by users, responsible for the fact 
that ryfc » ^fc. In contrast, we find that Email and WAP 
have no immediate impact on the user's social contacts. 
This is obvious for WAP, whose main purpose is to pro- 
vide Web access, thus does not require the cooperation of 
other users to be useful. The fact that Email is an indi- 
vidual service is highly paradoxical, however, given that 
Email is a communication service, just like Chat. There 
is a major difference, though: Chat is an entirely phone- 
based service, thus it is useless unless the phone contacts 
of a user also adopt it. In contrast, for Email the mo- 
bile phone serves only as an interface, which allows the 
user to communicate with all those who use electronic 
mail, either using the phone or a computer (which is the 
far most common case) to access it. Thus there is no 
need for all phone-based acquaintances of a user to adopt 
Email in order to make it useful for an individual phone 
user. Therefore, when it comes to mobile phone usage, 
Email is not subject to network externalities ^2^,12^ and 
increasing returns [24t], having the characteristics of an 
individual rather than a cooperative service. 

While the continuum theory proposed above provides 
an accurate description of the observed usage patterns, 
it is of value only if it offers predictions of practical im- 
portance. For many services and products that are bene- 
ficial to a user base, a question of paramount importance 
emerges: how can one enhance service usage? The stan- 
dard ways are advertisements and incentives, increasing 
awareness of the service and boosting the user base. Such 
measures are costly and therefore can be used only for a 
limited duration. They are fueled by the belief that once 
users become familiar with a service, they will likely con- 
tinue its usage. Yet, next we show that in the saturated 



regime such approaches are ineffective: once the incen- 
tive is removed, the number of users returns to the pre- 
incentive value. Indeed, Eq. ([H]) predicts that the num- 
ber of users depends only on rjk and /i^ for individual 
services. Incentives can temporarily increase the adop- 
tion rate (/ifc) and decrease the dropout rate (77^, ^fe), but 
once they are discontinued, the number of users {Uk{t)) 
returns to their pre-incentive value. To see if this result 
applies to all services, we used numerical simulations (see 
Supporting Text) with the adoption functions of Fig. 2c-j 
as input, finding that if we artificially doubled the num- 
ber of users, the usage returns to its original value for 
both individual and cooperative services (see Fig. 4a). 
The final evidence comes from two services, providing 
collect calls and text message storage, available to the 
same user base as the four services studied above. These 
services were promoted by campaigns that started after 
they have reached saturation. As Fig.|3]c and d show, the 
campaigns have resulted in more than a five-fold increase 
in the number of users, but both services returned to 
their saturation value once the campaigns ended. Taken 
together, we find that once the saturated regime has been 
reached, traditional methods for boosting the consumer 
base are only effective for a limited time. Yet, next we 
show that for cooperative services one can exploit the in- 
timate knowledge of the communication network to de- 
sign cost-effective incentives. 

Highly connected individuals or hubs exert dispropor- 
tional influence over the user base [25|. Therefore, if each 
user with degree exceeding kc is given permanent free us- 
age of service Si, increasing their likelihood of adoption 
(fik) and a decreasing their dropout rate {rjk and ^fc), such 
differentiated incentives will result in higher saturation 
values (see Fig. 4 e and /). Indeed, those who received 
the incentive will increase usage, an effect that applies 
to both individual and cooperative services. However, 
due to the significant A term for cooperative services, 
increased usage by hubs will also enhance the usage of 
their acquaintances, an effect which will be negligible for 
individual services. The question is, will this second, per- 
suasion induced effect make the incentive potentially cost 
effective, recovering the cost of providing free permanent 
usage for all hubs? To answer this we determined the im- 
pact of the incentive by calculating the ratio between the 
number of new users (the difference A between the pre- 
and post-incentive saturation values) and the number of 
hubs Nh{k > kc) that have been provided the incentive. 
As Fig. 46 shows, we find that the impact increases with 
kc, indicating that the larger the degree of the hubs the 
incentives were given to, the larger their potential impact. 
For individual services (A = 0), however, the impact is 
negligible and has only weak dependence on kc (Fig. 2^)). 
If one factors in the potential cost of the incentives, we 
find that hub-oriented permanent incentives could bene- 
fit cooperative services, given that for these the impact 
increases with kc, thus for any incentive cost there will be 
a kc value for which the lost revenue on the hubs can be 
recovered by the increased revenue from the users that 
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have adopted under the influence of the hubs. However, 
for individual services a hub-based incentive would likely 
be a losing proposition: the revenue lost from the heavy 
usage of the hubs cannot be recovered from the increased 
usage of their contacts, as for these services the impact 
of persuasion is negligible. 

In summary, while informational externalities, captur- 
ing the spread of information about a given service be- 
tween acquaintances, are expected to play an important 
role in the adoption of innovations, their role is dimin- 
ished in the saturated regime, where awareness of the 
benefits of a given service arc widely known. We would 
expect, therefore, that social network effects should cease 
to correlate with service usage. Here we provide direct 
evidence of the contrary: for all services social network 
affects can be detected, even in the stationary state. The 
magnitude of this effect is service-dependent, however, 
leading to the coexistence on the same social network 



of two distinct classes of services. Wc find that services 
characterized by strong dyadic and community effects are 
driven by network externalities, making the use of a ser- 
vice more valuable for a user if it is used by its acquain- 
tances as well. Most importantly, we find that successful 
interventions into service usage require a detailed under- 
standing of the interplay between the structure of the 
underlying communication network and the usage mech- 
anism. There is no one-size fits all advertising strategy: 
for a given innovation some strategies are guaranteed to 
fail, while strategies based on the quantitative knowledge 
of the spreading mechanism are more likely to succeed. 
Finally, our approach offers a coherent framework for un- 
derstanding service usage in an environment where both 
the social network and service usage can be monitored, 
potentially opening new avenues for research into social 
diffusion and human dynamics. 
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FIG. 1: Characterizing service adoption patterns, (a— d) The number of users for each service in two week intervals (circles, 
continuous), indicating that usage has reached a stationary value. The cumulative number of users (squares, dashed) represent 
all those who have used the service at least once since the beginning of our observational period, (e— h) The fraction of users 
at distance d from a service user (blue, continuous) or non-user (green, dashed), averaged over 100 randomly chosen users 
(non-users). The horizontal line indicates the expectation based on random adoption (pi). (i-1) Adoption rates in function of 
the user degree k. Filled symbols stand for call data aggregated over a one-month period, while empty ones denote a one-month 
period 10 months later. The solution of Eq. valid for Email and WAP, is shown as a solid line. 
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FIG. 2: A local neighborhood of the communication network of telephone contacts, (a), (b) In both figures each node 
corresponds to an individual (mobile phone number), two individuals being connected if they talked or sent short messages 
to each other during a one-month period. The network was drawn by selecting an individual, and displaying all nodes within 
distance four from her. To illustrate service usage on this network. Email users are colored red in (a), while in (b) Chat users 
are shown in red. Note that while Email users are scattered uniformly, Chat users form compact communities, (c— f) The 
degree-dependent adoption functions characterizing the adoption of services thanks to external influence {^k) and the influence 
of service users with direct contact to the user (Xk) (see Supporting Text). For each adoption function we show the results of 
two independent measurements (filled and empty symbols), made 10 months apart (the two intervals denoted by the upper 
indices in the legends), indicating that the functional form of the adoption functions is time-independent. The vertical shifts 
represent fiuctuations in the coverage (see Fig. la-d), and do not affect our results, (g— j) Adoption functions capturing the 
service suspension by isolated {rjk) or connected (^fc) users, again measured for two different time intervals. 



8 




FIG. 3: Predicted and measured adoption rates, (a-d) The growtli of the adoption rates (fraction of all users that use the 
specific service) as predicted by the integration of Eqs. (1)~(2). The adoption rate of isolated (connected) users is shown as 
triangles (squares), and the sum of the two, corresponding to the total rate of adoption, as circles, (e-h) The predicted adoption 
rates for one month using Eqs. ©-([B} and the adoption functions in Fig. ^jp-j, with solid lines, and the actual measurements 
with symbols. The fraction of isolated users is shown as triangles, that of connected ones with squares, and the sum of the two 
with circles. 
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FIG. 4: The impact of advertising and targeted interventions on the number of service users, (a) The changes in the number of 
users after it was doubled by a short-term effect, such as advertising, for Email and Chat, (b) The gain in the number of users 
A in the stationary states (see e and /), normalized by the number of subscribers who have been offered permanent incentives 
{N{k > fcc)). The continuous curve corresponds to Chat, while the dashed one has the same parameters as Chat, but with 
Xk = 0. (c), (d) The number of service users for two services, for which the provider has boosted the number of users using 
advertising and incentives, (e) The changes in the number of adopters for Chat after hub-based incentives were initiated at 
t — 50, by decreasing in half rj^ and ^j, for hubs with degree over 5, 10, and 15, for the three curves from the top to bottom, 
respectively, (f) Number of adopters as in e, with the adoption function Xk set to at the beginning of the simulation, to 
mimic the impact of the lack of personal persuasion. 
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Supporting Text for "Network effects in service usage" 

Gabor Szabo and Albert-Laszlo Barabasi 



I. DATASETS 



Based on the call records of a mobile telephone opera- 
tor with residential subscribers, we performed our study 
on data comprising communication spanning 14 months, 
starting April 15, 2004. We neglected calls whose desti- 
nation or origin was outside the company, which left us 
with approximately 5.5 million nodes in the call network 
for each month. The links between nodes were defined by 
initiated voice calls and short messages (SMS), the two 
most common services with origin and destination in- 
formation. While these are directed by nature, we made 
the communication graph undirected, yielding more than 
10 million links for one month. The call data are aggre- 
gated into intervals of two weeks, each record showing the 
caller, the callee, the type of the communication, and its 
frequency during the two weeks. For call graphs, we accu- 
mulated two of the datasets together such that two users 
are linked if they communicated at least once during the 
two weeks (see Table H] for the time intervals). While 
the overwhelming majority of individuals only use voice 
calls for communications, it was possible to identify users 
of the studied services based on additional fields present 
in the datasets or by checking the callee field for unique 
service numbers that need to be dialed to use the service. 



^ ^ 




FIG. 5: Schematic representation of the mobile phone users' 
network. Users of a service are shown on the bottom. The 
estimation of the adoption functions /i^, 77^, A^, and Q. is 
based on two time windows of 4 weeks each with a 2-week 
overlap, represented by the two gray boxes (Tre/ and Tcomp, 
respectively). Users only in Tref but not in Tcomp allow us 
to determine the terms in Eqs. (|10p - (lll|l . responsible for the 
temporary suspension of the service, i.e. —rj for isolated dis- 
continuation and — ^ for connected discontinuation. Individ- 
uals who are users in Tcomp only (they were not users before 
Tcomp) are responsible for the gain in the rate equations, i.e. 
the /i and A terms for isolated and connected new users, re- 
spectively. 



II. MEASURING THE ADOPTION 
FUNCTIONS Atfc, r?fc, Afe, AND ^fc 

While Eqs. (1) and (2) are continuous in time, in or- 
der to be able to measure the adoption functions, we 
need to consider their discretized versions, given that the 
datasets for service usage are aggregated for periods of 
two weeks. Therefore, we write the discretized adoption 
equations as 



^^k {'-^comp : ^ref ) 



(9) 



C.+ 



{Xlur+f^l}nUTref)~anUTref) 



where we denoted with u\, u'j., n\, and nj: the measured 
values of the variables C/^, C/^, iV^, and iV^, respectively. 
Tref and Tcomp deuotc two time frames with data for both 
service usage and the communication map between the 
subscribers. The network of social contacts was measured 
with respect to Tref- two individuals were considered to 
be acquaintances if they called or sent a short message 
to each other in the Tref time period. An isolated ("«") 



individual was an individual without a service user in 
the immediate neighborhood in T^ef, while a connected 
( "c" ) individual had at least one user among its acquain- 
tances. A schematic representation of the measurement 
procedure is shown in Fig. O 

We first focus on Eq. pn]) . which describes the change 
in the number of isolated users with degree k during the 
Tcomp time interval, lS.u\,{T comp, Tref), where the refer- 
ence social network and usage data was taken from Tref ■ 
The change is the result of a gain in users due to isolated 
adoption {u]l^), and a loss of these through isolated ser- 
vice abandoning (u^' ), both of which can be experimen- 
tally measured: u]:^ is the number of new isolated users 

ref. 



(10) in Tcomp with degree k who were not users during T, 



and gives the number of isolated users in Tref who 
in turn abandoned the service in Tcomp- Since these pro- 
cesses are precisely what /i^ and 77^ aim to capture, we 
can determine the two adoption functions using 



Vk = 



KiTref) 



KiTrefV 



(11) 
(12) 
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where u\{Tref) and n\{Tref) are the number of isolated 
users and non-users with degree k in T^ef, respectively. 

Connected potential users in T^/ {n%{Tref)) are also in- 
fluenced to adopt by interpersonal communication, apart 
from external channels. Also, connected users {u'j^{Tref)) 
may also abandon the service, allowing us to measure the 
two other adoption functions and f^: 



nliTref) 
c, — 

uliTrefY 



(13) 
(14) 



xj!" is the measured average exposed degree for 
connected potential users, i.e. the average number of user 
neighbors for degree- fc connected individuals (note that 
Mfc" > a,nd it is approximated by 1 + Q{t){k — 1) in 
the rate equation (2). 

The observation periods used in this article are shown 
in TablelH both T^e/ and Tcomp span one month, and they 
overlap by two weeks. On one hand, the overlap was 
necessary, because we wished to make T^e/ and Tcomp 
long enough so that considerable adoption can be ob- 
served, and the longest consecutive time frame for which 
we have communication data is one and a half months. 
On the other hand, it demonstrates that the choice of T^e/ 
and Tcomp is arbitrary, as long as the adoption functions 
are understood to refer to the chosen time window span 
and the relative shift between Tcomp and Tref. There- 
fore, the adoption functions shown in Fig. 2c-j describe 
adoption and abandoning probabilities for individuals ob- 
served during one month, with respect to the next month 
two weeks into the future. The adoption rates also have 
to be measured accordingly, as has been done in Fig. 2c- j 
for Tref of the 2005 data (see also Table U). 

The interpretation of the adoption functions is the fol- 
lowing: an individual who at least once uses the ser- 
vice during the one-month observation period is called 
an user. If we were able to monitor service usage con- 
tinuously in time, the number of users at any point in 
time would follow a continuous curve. The number of 
users at time t, in this case, would give the number of 
individuals who used the service in the one-month period 
surrounding t. Since service usage is provided to us in 
two-week intervals, we can only sample this function at 
discrete intervals, such as T^cf and Tcomp- While at any 
point in time the adoption functions /ifc, rjk, Xk, and ^fc 
would be calculated as the slope of their respective func- 
tions giving the number of new users or abandonings, we 
are in effect approximating them by the slope of the line 
connecting the discrete sampled values. 



III. TIME INDEPENDENCE OF THE 
ADOPTION FUNCTIONS 

A fundamental assumption of our modeling framework 
is that the adoption functions fik^ A^, rjk and are time- 




FIG. 6; The degree distributions of the two call networks in 
the reference time windows of Fig. 2c-j, respectively. Data 
from April 2004 are shown in green, and those from February 
2005 in blue. Logarithmic binning was applied to the raw 
data, as shown by the larger dots. A function of the form 
A{k + ko) ' was fitted to the data points (thick red line), with 
parameters fco = 9.3 and 7 = 6.1. The distributions overlap 
to a high degree, indicating the time-independent nature of 
P{k). 



independent, and they characterize the user base's atti- 
tude to a specific innovation. Thus it is crucial to test 
if this time independence is indeed true in the available 
dataset. For this we have measured the adoption rates 
and the adoption functions independently for two differ- 
ent time intervals that are ten months apart from each 
other (see Table H]). The results of the measurements ob- 
tained for the first interval are shown as filled symbols in 
Fig. li-l and Fig. 2c-j, respectively, while those from the 
second interval are shown as empty symbols. As one can 
notice, the functional forms of the adoption curves re- 
main unchanged in time. The only noticeable systematic 
difference is the vertical shifts in the curves, while leaving 
the functional form unchanged. These vertical shifts are 
a simple consequence of the fluctuations in the service 
usage observed in Figs. la~d. These measurements thus 
validate our assumption that the only significant time 
dependence comes in the number of users, as described 
by Eqs. (l)-(2), and at those time scales the adoption 
bmctions can be taken to be time independent. 



IV. INTEGRATING THE RATE EQUATIONS 

Once the adoption functions have been measured, it 
was possible to predict the time evolution of the number 
of users by numerically integrating the rate equations 
(l)-(8), using the measured adoption functions /ifc, rjk, 
Afc, and ^k- As input we took the degree distribution of 
the social network at Tref , which is rather stable in time 
as Fig. [S] indicates. We set an initial condition in which 
we had no service users in the system at i = 0, i.e. Ul{t = 
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Adoption functions in Fig. 2c- j 


Tref 


Tcomp 


social network 


filled symbols 
empty symbols 


4/15/2004-5/15/2004 
2/15/2005-3/15/2005 


5/1/2004-6/1/2004 
3/1/2005-4/1/2005 


4/15/2004-5/15/2004 
2/15/2005-3/15/2005 



TABLE I; The observation periods chosen for the measurement of the adoption functions, as shown in Fig. 2c-]. The fourth 
column shows the period in which the network of social contacts was measured. 



0) = U^{t = 0) = iV^(t = 0) = 0, and Nl{t = 0) = Nk, 
where N^. denotes the number of individuals with degree 
k. We chose a sufficiently small value for the time step 
(At = 0.1), and by Euler integration we determined the 
adoption rates for the desired range in time. 



V. NUMERICAL SIMULATIONS 

The continuum theory (l)-(2) represents an approxi- 
mation to the real spreading process: it cannot account 
for all the details of the community formation process, so 
when services start to percolate (higher coverages) it will 
likely result in deviations from the real process. There- 
fore, to test the validity of the model, as well as the im- 
pact of advertisements (which in the short term result in 
high service adoption levels) we have carried out numer- 
ical simulations of the usage of the services on the social 
network of Tref in 2004. For this, we fitted power laws 
of the form Ak^ on the adoption functions of Fig. 2c- j 
using a least-squares fit, and used them as the empirical 
adoption functions in the simulations. For example, for 
Chat, the following fits were used (see Fig. 2c-j, straight 
lines) : 



Afe 
a- 



0.0013fc°-2942 
0.3751fc-°o"5 

0.0053fc°o"4 
0.3G38fc-°-2789^ 



We then initialized every individual in the network to be 
a non-user of the service, and we implemented a Monte- 
Carlo simulation using as input a close approximation 
to the empirically measured adoption functions. In ev- 
ery time step we updated each individual's service usage 
state by comparing the appropriate adoption function 
value to a randomly drawn number from a uniform dis- 
tribution between and 1. Thus for isolated (connected) 
users r^k (^fc), and for isolated (connected) potential users 
Mfc {^k) was used according to their degree. We used par- 
allel update to keep track of the changes in the states of 
individuals. 

To simulate the effect of campaigns and advertise- 
ments, upon reaching saturation at i = 50, we deter- 



mined the number of users in the system. We then 
changed the states of the same number of randomly cho- 
sen non-users so that they became users, thus simulating 
that the service usage was artifically increased by a fac- 
tor of two as a result of the campaigns. After this, we let 
the system relax and reach equilibrium again (Fig. 4a). 

To study the effect of targeted campaigns, at t = 50 we 
halved the adoption functions 77^ and for every person 
with a degree greater or equal to the preset critical degree 
fcc, thus simulating a permanent change in the behavior 
for these individuals: they would be less likely to abandon 
services than before. We then allowed the system to reach 
the stationary state again. 

In addition, where mentioned, we repeated the sim- 
ulations several times and averaged the time-dependent 
adoption rates or other quantities to minimize the effect 
of fluctuations. 



10^ 



CM ._ 

Q. 



r 



10' 




Email WAP Chat MyFriends 



FIG. 7: The probability pi that a link connects two service 
users, divided by the random expectation pi when every in- 
dividual in the system adopts with the same probability pi. 
Pi is the overall measured penetration of the service. 



